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Amendments to the Claims : 

This listing of claims replaces all prior versions and listings of claims in the application: 

1 . (Currently Amended) A method of comparing the semantic content of two or 
more documents, comprising: 

accessing two or more documents; 

performing a linguistic analysis on each document; and 

defining outputting a semantic vector for each document based on the linguistic 
analysis , said semantic vector having multiple components, wherein each component of 
said semantic vector has at least: 

a term included word or phrase appearing in the document or a 
synonym of sai d term; word or phra s e ; 

a weighting factor relating to an importance of said term associated 
with said word or phrase or synonym ; and 

a frequency value relating to a number of occurrences of said term . 

2. (Previously Presented) The method of claim 1, wherein the linguistic analysis 
comprises sentence analysis. 

3. (Previously Presented) The method of claim 2, wherein the sentence analysis 
comprises a syntactic analysis and a semantic analysis. 

4. (Canceled) 



5. (Canceled) 
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6. (Currently Amended) The method of claim 1 , wherein each component of the 
semantic vecto r for at least one of the documents comprises can have multiple 
dimensions. 

7. (Currently Amended) The method of claim 1 , wherein each component of the 
semantic vecto r for at least one of the documents further comprises a subordinate 
concept value. 

8. (Canceled) 

9. (Canceled) 

10. (Currently Amended) The method of claim 1, wherein some o f the components 
of the semantic vector for at least one of the documents have ||{Hmain term - 
subordinate term pairs[[}]] as their first value. 

1 1 . (Currently Amended) The method of claim 1 , wherein the semantic vector 
comprises [[is]] a multi-dimensional vector defined by the content of a semantic net. 

12. (Previously Presented) The method of claim 11, wherein the content of the 
semantic net is augmented by relative weights, strengths, or frequencies of occurrence of 
the features within the semantic net. 

13. (Currently Amended) The method of claim [[1]]37, wherein [[the]]an output of 
said defined algorithm is a measure of at least one of semantic distance, semantic 
similarity, semantic dissimilarity, degree of patentable novelty and degree of anticipation. 
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14. (Currently Amended) A method of comparing two or more documents, 
comprising: 

linguistically analyzing two or more documents to identify at least one term 
group in each document, each term group comprising a main term and at least one 
subordinate term semantically related to the main term ; 

generating a semantic vector associated with each document , the semantic vector 
comprising a plurality of components, each component including: 
a term group in the document; 

a frequency value relating to a number of occurrences of the term 

group; and 

a weighting factor relating to an importance of at least part of the 

term group ; and 

comparing the semantic vectors using a defined metric, wherein said metric 
measures the semantic distance between two documents as a function of at least the 
frequency values included in the semantic vectors for the two documents, t he 
relative frequencies of common terms and of common {main term - subordinate term 
pair) between the two document s . 

15. (Canceled) 

16. (Currently Amended) The method of claim 14, wherein [[a]] the main term 
common term between two documents includes two terms that arc synonyms of the 
main term . 



17. (Previously Presented) The method of claim 14, wherein one or more of said 
two or more documents are located using an autonomous software or 'bot program. 
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18. (Currently Amended) The method of claim 17, wherein the 'bot program[[:]] 
automatically analyzes each document in a defined domain or network by executing a 
series of rules and assigning an overall score to the document. 

19. (Previously Presented) The method of claim 1 8, wherein all documents with a 
score above a defined threshold are linguistically analyzed. 

20. (Previously Presented) The method of claim 14, wherein the semantic vector is 
a quantification of the semantic content of each document. 

2 1 . (Currently Amended) The method of claim 1 4, wherein the semantic vector 
can have multiple components, and each component has can have multiple 
dimensions. 



22. (Canceled) 
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23. (Currently Amended) A system for comparing two or more documents, 
comprising: 

a document inputter, arranged to access two or more documents; 

a semantic analyzer, arranged to perform a linguistic analysis on each document 
to identify at least one term group in the document, each term group comprising a 
main term and at least one subordinate term semantically related to the main term ; 

a semantic quantifier, arranged to output a quantified representation of a semantic 
content of each document , the quantified representation based at least in part on: 
a term group in the document; and 

a weighting factor relating to an importance of at least part of the 

term group ; and 

a comparator, arranged to compare the quantified representations using a defined 
algorithm, wherein said defined algorithm measures the semantic distance between two 
documents as a function of at least the weighting factors associated with the 
quantified representations for the two documents, the relative frequencies of 
common terms and of common (main term - subordinate term pairs) between the 
two document s . 
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24. (Currently Amended) A system for comparing two or more documents, 
comprising: 

a document inputter, arranged to access two or more documents; 

a semantic analyzer, arranged to perform a linguistic analysis on each document 
to identify at least one term group in the document, each term group comprising a 
main term and at least one subordinate term semantically related to the main term ; 

a semantic vector generator, arranged to output a semantic vector associated with 
each document , each semantic vector comprising a plurality of components, each 
component including: 

a term group in the document; 

a frequency value relating to a number of occurrences of the term 

group; and 

a weighting factor relating to an importance of at least part of the 

term group ; and 

a comparator, arranged to compare the semantic vectors using a defined metric, 
wherein said metric measures the semantic distance between two documents as a function 
of at least the frequency values included in the semantic vectors for the two 
documents, the relative frequencies of common terms and of common {main term 
subordinate term pairs} between the two documents. 



25. (Canceled) 
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26. (Currently Amended) A computer program product comprising a computer 
usable medium having computer readable program code means embodied therein, the 
computer readable program code means in said computer program product comprising 
means for causing a computer to: 

access two or more documents; 

perform a linguistic analysis on each document to identify at least one term 
group in the document, each term group comprising a main term and at least one 
subordinate term semantically related to the main term ; 

output a quantified representation of a semantic content of each document , the 
quantified representation based at least in part on: 
a term group in the document; 

a frequency value relating to a number of occurrences of the term 

group; and 

a weighting factor relating to an importance of at least part of the 

term group ; and 

compare the quantified representations using a defined algorithm, wherein said 
defined metric measures the semantic distance between two documents as a function of 
at least the frequency values associated with the quantified representations for the 
two documents, the relative frequencies of one of common terms and common [main 
term - subordinate term pairs) between the two documents. 
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27. (Currently Amended) A computer program product comprising a computer 
usable medium having computer readable program code means embodied therein, the 
computer readable program code means in said computer program product comprising 
means for causing a computer to: 

linguistically analyze [[ing]] two or more documents to identify at least one term 
group in the document, each term group comprising a main term and at least one 
subordinate term semantically related to the main term ; 

generate[[ing]] a semantic vector associated with each document , each semantic 
vector comprising a plurality of components, each component including: 
a term group in the document; and 

a weighting factor relating to an importance of at least part of the 

term group ; and 

compare[[ing]] the semantic vectors using a defined metric, wherein said metric 
measures the semantic distance between two semantic vectors as a function of at least 
the weighting factors included in the two semantic vectors, the relative frequenci e s 
of one of common terms and common (main term - subordinate term pairs) between 
the two document s . 

28. (Previously Presented) The computer program product of claim 27, wherein the 
computer readable program code means in said computer program product further 
comprises means for causing a computer to: 

identify one or more of said two or more documents using an autonomous 
software or 'bot program. 

29. (Currently Amended) The computer program product of claim 28[[27] ] , 
wherein said 'bot program automatically analyzes each document in a defined domain or 
network by executing a series of rules and assigning an overall score to the document. 
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30. (Previously Presented) The computer program product of claim 27, wherein the 
semantic vector is a quantification of the semantic content of each document. 

3 1 . (Currently Amended) The computer program product of claim 27, wherein 
an[[the]] output of said defined metric is a measure of at least one of semantic distance, 
semantic similarity, semantic dissimilarity, degree of patentable novelty and degree of 
anticipation. 



32. (Canceled) 
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33. (Previously Presented) A system for comparing two or more documents, 
comprising: 

a document inputter, arranged to access two or more documents; 

a semantic analyzer, arranged to perform a linguistic analysis on each document; 

a semantic vector generator, arranged to output a semantic vector associated with 
each document; and 

a comparator, arranged to compare the semantic vectors using a defined metric, 
wherein said defined metric is one of: 

[Sqrt(fl 2 + f2 2 + O 2 + f4 2 + + f(N-l) 2 fN 2 )/n] * 100, wherein f is a 
difference in frequency of a common term between two documents and n is the number 
of terms those documents have in common; or 

Sqrt(sum((w-Delta) A 2 * w-Avg))/(Log(n) A 3 * 1000), wherein w-Delta is 
the difference in weight between two common terms, w-Avg is the average weight 
between two common terms, and n is the number of common terms, between two 
documents. 



Applicant : Kas Kasravi, et al. Attorney Docket No.: 14012-140001/50-02-032 

Serial No. : 10/766,308 

Filed : January 27, 2004 

Page : 12 of 18 



34. (Previously Presented) A method of comparing two or more documents, 
comprising: 

linguistically analyzing two or more documents; 
generating a semantic vector associated with each document; and 
comparing the semantic vectors using a defined metric, wherein said defined 
metric is one of: 

[Sqrt(fl 2 + f2 2 + fi 2 + f4 2 + + f(N-l) 2 fN 2 )/n] * 100, wherein f is a 
difference in frequency of a common term between two documents and n is the number 
of terms those documents have in common; or 

Sqrt(sum((w-Delta) A 2 * w-Avg))/(Log(n) A 3 * 1000), wherein W-Delta is 
the difference in weight between two common terms, w-Avg is the average weight 
between two common terms, and n is the number of common terms, between two 
documents. 
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35. (Previously Presented) A computer program product comprising a computer 
usable medium having computer readable program code means embodied therein, the 
computer readable program code means in said computer program product comprising 
means for causing a computer to: 

access two or more documents; 

perform a linguistic analysis on each document; 

output a quantified representation of a semantic content of each document; and 
compare the quantified representations using a defined algorithm, wherein said 

defined algorithm is one of: 

[Sqrt(fl 2 + f2 2 + O 2 + f4 2 + + f(N-l) 2 fN 2 )/n] * 100, wherein f is a 

difference in frequency of a common term between two documents and n is the number 

of terms those documents have in common; or 

Sqrt(sum((w-Delta) A 2 * w-Avg))/(Log(n) A 3 * 1000), wherein w-Delta is 

the difference in weight between two common terms, w-Avg is the average weight 

between two common terms, and n is the number of common terms, between two 

documents. 

36. (New) The method of claim 1, wherein said term comprises at least one of a 
word or a phrase. 

37. (New) The method of claim 1, further comprising comparing the semantic 
vectors based on a defined algorithm. 

38. (New) The method of claim 14, wherein the at least one subordinate term 
includes synonyms of one of the subordinate terms. 



39. (New) The method of claim 14, wherein one or more of the at least one 



subordinate term or the main term comprises a phrase. 
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40. (New) The method of claim 14, wherein the weighting factor comprises a 
plurality of different weighting factors and each of the different weighting factors 
relates to the importance of the main term or a subordinate term in the term group. 



